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Competing styles of Statistical Mechanics have been introduced as 
practical succedaneous to the conventional well established Boltzmann- 
Gibbs statistical mechanics, when in the use of the latter the researcher 
is impaired in his/her capacity for satisfying the Criteria of Efficiency 
and/or Sufficiency in statistics [Fisher, 1922], that is, a failure in the 
characterization (presence of fractality, scaling, etc.) of the system re- 
lated to some aspect relevant to the given physical situation. To patch 
this limitation on the part of the observer, in order to make predictions 
on the values of observables and response functions, are introduced 
unconventional approaches. We present a detailed description of their 
construction and a clarification of its scope and interpretation. Also, 
resorting to the use of the particular case of Renyi's unconventional 
statistics is built a nonequilibrium ensemble formalism. The uncon- 
ventional distribution functions of fermions and bosons are obtained, 
and in the follow-up article we describe applications to the study of 
experimental results in semiconductor physics and in electro-chemistry 



involving nanometric scales and fractal-like structures, and some ad- 
ditional theoretical analysis is added. 
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1. INTRODUCTION 



More than twenty years ago Montroll and Shlesinger wrote that in the world of 
the investigation of complex phenomena that requires statistical modelling and 
interpretation several competing styles have been emerging, each with its own 
champions [1]. In the intervening years up to this beginning of the 21st century, a 
good amount of effort - with a flood of papers - has been dispensed to the topic. 
What is at play consists in that in the study of certain physico-chemical systems 
we may face difficulties when handling situations involving fractal-like structures, 
correlations (spatial and temporal) with some type of scaling, turbulent or chaotic 
motion, small size (nanometric scale) systems with eventually a low number of 
degrees of freedom, etc. These difficulties consist, as a rule, in that the researcher 
is unable to satisfy Fisher's Criteria of Efficiency and/or Sufficiency [2] in the 
conventional, well established, physically and logically sound Boltzmann-Gibbs 
statistics, meaning an impairment on his/her part, to include the relevant and 
proper characterization of the system. To mend these difficulties, and to be able 
to make predictions (providing an understanding, even partial, of the physics of 
the system but of interest in, for example, analyzing characteristics of devices 
technologically relevant, as illustrated in the follow up article) one may resort to 
alternative statistics other than the Boltzmann-Gibbs one, which are not at all 
extensions of the latter but, as said, introduce a patching method. 

Several approaches do exist and we can mention what can be labelled as Gen- 
eralized Statistical Mechanics (see for example P. T. Landsberg, in Ref. [3]), 
Superstatistics (see for example E. G. D. Cohen and C. Beck in Refs. [4,5]), 
Nonextensive Statistics (see for example the Conference Proceedings in Ref. [6]), 
and some particular cases are statistical mechanics based on Renyi Statistics (see 
for example I. Procaccia in Ref. [7] and T. Arimitzu in Refs. [8, 9]), Kappa (some- 
times called Deformational) statistics (see for example V. M. Vasyliunas in Ref. 
[10] and Kaniadakis in Ref. [11]). A systematization of the subject, accompanied 
of a description of a large number of different possibilities, are described in what 
we have dubbed as Unconventional Statistical Mechanics, whose general theory 
and its discussion is presented in this paper while in the follow up one illustrations 
of its application in several physico-chemical systems are presented. 

We begin noticing that Statistical Mechanics of many-body systems has a long 
and successful history. The introduction of the concept of probability in physics 
originated mainly from the fundamental essay of Laplace [12], who incorporated 
and extended some earlier seminal ideas (see for example [13]). As well known, 



Statistical Mechanics attained the status of a well established discipline at the 
hands of Maxwell, Boltzmann, Gibbs, and others, and went through some steps 
related to changes, not in its fundamental structure, but just on the substrate 
provided by microscopic mechanics. Beginning with classical dynamics, statisti- 
cal mechanics incorporated - as they went appearing in the realm of Physics - 
relativistic dynamics and quantum dynamics. Its application to the case of sys- 
tems in equilibrium proceeded rapidly and with exceptional success: equilibrium 
statistical mechanics gave - starting from the microscopic level - foundations to 
Thermostatics, and the possibility to build a Response Function Theory. Applica- 
tions to nonequilibrium systems began, mainly, with the case of local equilibrium 
in the linear regime following the pioneering work of Lars Onsager [14] (see also 
[15])- 

For systems arbitrarily deviated from equilibrium and governed by nonlinear 
kinetic laws, the derivation of an ensemble-like formalism proceeded at a slower 
pace than in the case of equilibrium, and somewhat cautiously, with a long list of 
distinguished scientists contributing to such development. It can be noticed that 
Statistical Mechanics gained in the fifties an alternative approach sustained on the 
basis of Information Theory [13, 16-23]: It invoked the ideas of Information Theory 
accompanied with ideas of scientific inference [24,25], and a variational principle 
(the latter being Jaynes' principle of maximization of informational uncertainty 
- also referred-to as informational-entropy - and called MaxEnt for short), com- 
pounding from such point of view a theory dubbed as Predictive Statistical Me- 
chanics [13,16-21,26]. It should be noticed that this is not a new paradigm in 
Statistical Physics, but a quite useful and practical variational method which cod- 
ifies the derivation of probability distributions, which can be obtained by either 
heuristic approaches or projection operator techniques [27-29]. It is particularly 
advantageous to build nonequilibrium statistical ensembles, as done here, when it 
systematizes the relevant work on the subject that renowned scientists provided 
along the past century. The informational-based approach is quite successful in 
equilibrium and near equilibrium conditions [16, 17, 22, 23], and in the last decades 
has been, and is being, also applied to the construction of a generalized ensemble 
theory for systems arbitrarily away from equilibrium [28-30]. The nonequilibrium 
statistical ensemble formalism (NESEF for short) provides mechanical-statistical 
foundations to irreversible thermodynamics (in the form of Informational Statisti- 
cal Thermodynamics - 1ST for short [31-34]), a nonlinear quantum kinetic theory 
[28, 29, 35] and a response function theory [29, 36] of a large scope for dealing with 
many-body systems arbitrarily away from equilibrium. NESEF has been applied 



with success to the study of a number of nonequilibrium situations in the physics 
of semiconductors (see for example the review article of Ref. [37]) and polymers 
[38], as well as to studies of complex behavior of boson systems in, for example, 
biopolymers (e.g. Ref. [39]). It can also be noticed that the NESEF-based non- 
linear quantum kinetic theory provides, as particular limiting cases, far-reaching 
generalizations of Boltzmann [40] , Mori (together with statistical foundations for 
Mesoscopic Irreversible Thermodynamics [41]) [42], and Navier-Stokes [43] equa- 
tions and a, say, Informational Higher-Order Hydrodynamics, linear [44] and non- 
linear [45]. 

NESEF is built within the scope of the variational method on the basis of the 
maximization of the informational-entropy in Boltzmann-Gibbs-Shannon-Jaynes 
sense, that is, the average of minus the logarithm of the time-dependent - i.e. 
depending on the irreversible evolution of the macroscopic state of the system - 
nonequilibrium statistical operator. It ought to be emphasized that informational- 
entropy - a concept introduced by Shannon - is in fact the quantity of uncertainty 
of information, and has the role of a generating functional for the derivation of 
probability distributions (for tackling problems in Communication Theory, Physics, 
Mathematical Economics, and so on). There is one and only one situation when 
Shannon- Jaynes informational-entropy coincides with the true physical entropy of 
Clausius in thermodynamics, namely, the case of strict equilibrium [8, 46-49] . For 
short, we shall refer to informational-entropy as infoentropy. As already noticed 
the variational approach produces the well established equilibrium statistical me- 
chanics, and is providing a satisfactory formalism for describing nonequilibrium 
systems in a most general form. This Boltzmann- Gibbs Statistical Mechanics al- 
lows for a proper description of the physics of condensed matter, but in some kind 
of situations, for example, involving nanometric-scale systems with some type or 
other of fractal-like structures or systems with long-range space correlations, or 
particular long-time correlations, it becomes difficult to apply because of a defi- 
ciency in the proper knowledge of the characterization of the states of the system 
in the problem one is considering (at either the microscopic or/and macroscopic or 
mesoscopic level). This is, say, a practical difficulty (a limitation of the researcher) 
in an otherwise extremely successful physical theory. 

In fact, in a classical and fundamental paper of 1922 [2] by R.A.Fisher, titled 
"On the Mathematical Foundations of Theoretical Statistics" , are presented the 
basic criteria that a statistics should satisfy in order to provide valuable results. In 
what regards present day Statistical Mechanics in Physics two of them are of ma- 
jor relevance, namely the Criterion of Efficiency and the Criterion of Sufficiency. 



This is so because of particular constraints that impose recent developments 
in physical situations involving small systems (nanotechnology, nanobiophysics, 
quantum dots and heterostructures in semiconductor devices, one-molecule tran- 
sistors, fractals-electrodes in microbatteries, and so on), where on the one hand 
the number of degrees of freedom entering in the statistics may be small, and on 
the other hand boundary conditions of a fractal-like character are present which 
strongly influence the properties of the system, what makes difficult to introduce 
sufficient information for deriving a proper Boltzmann-Gibbs probability distribu- 
tion. Other cases when sufficiency is difficult to satisfy is the case of large systems 
of fluids whose hydrodynamic motion is beyond the domain of validity of the clas- 
sical standard approach. It is then required the use of a nonlinear higher-order 
hydrodynamics, eventually including correlations and other variances (a typical 
example is the case of turbulent motion). Also we can mention other cases where 
long-range correlations have a relevant role (e.g. velocity distribution in clusters 
of galaxies at a cosmological size, or at a microscopic size the already mentioned 
case of one-molecule transistors where Coulomb interaction between carriers is 
not screened and then of long range creating strong correlations in space with 
problems of scaling). 

Hence, we may say that the proper use of the universal Boltzmann-Gibbs 
statistics is simply impaired because of either a great difficulty to handle the re- 
quired information relevant to the problem in hands, or incapacity on the part 
of the researcher to have a correct access to such information, and consequently, 
out of practical convenience or the force of circumstances, respectively, a way to 
circumvent this inconveniency in such kind of "anomalous" situations, consists 
to resort to the introduction of modified forms of the informational-entropy, that 
is, other than the quite general one of Shannon- Jaynes, the one that leads to 
the well established and physically and logically sound statistics of Boltzmann- 
Gibbs. These modified infoentropies are built in terms of the deficient character- 
ization one does have of the system, and are dependent on parameters - called 
information-entropic indexes, or infoentropic indexes for short with the under- 
standing that refer to the infoentropy . 

We restate the fundamental fact that these infoentropies are generating func- 
tional for the derivation of probabilities distributions, and are not at all to be 
confused with the physical entropy of the system. Recently it has been consid- 
ered the proposition that a particular one among the infinitely-many that can 
be defined -as shown as we proceed - comes to supersede the supposedly more 
restricted one of Boltzmann-Gibbs as the entropy of systems in Nature [50-52]. 



Such "entropy" has the form adapted to Physics of the structural infoentropy 
of Havrda-Charvat of Table II below, which is, we insist, a generating functional 
for deriving heterotypical distributions to patch the difficulties with the universal 
Boltzmann-Gibbs-Shannon-Jaynes one (or measure of Kiillback-Leibler of Table 
I) when we face our (not of the statistics) limitations in satisfying Fisher's crite- 
ria of efficiency and/or sufficiency or, according to Renyi [53] when dealing with 
incomplete information. 

This alternative approach originated in the decades of the 1950's and 1960's at 
the hands of statisticians, being extensively used in different disciplines (economy, 
queueing theory, regional and urban planning, nonlinear spectral analysis, and so 
on). Some approaches were adapted for use in physics, and we present here an 
overall picture leading to what can be called Unconventional Statistical Mechanics 
(USM for short), consisting, as noticed, in a way to patch the lack of knowledge of 
characteristics of the physical system which are relevant for properly determining 
one or other property (see also P. T. Landsberg in Refs. [47] and [3]) impairing 
the correct use of the conventional one. 

A large number of possible infoentropies can be explored, and Peter Landsberg 
quite properly titled an article Entropies Galore! [47]. An infinite family is the 
one that can be derived from Csiszer's general measure of cross-entropy (see for 
example [54]); other family has been proposed by Landsberg [3]; and specific 
informational entropies are, among others, the ones of Skilling [55] - which have 
been used in mathematical economy -, and of Kaniadakis [56] who used it in the 
context of special relativity [11]. They, being generating functionals of probability 
distributions, give rise to particular forms of statistics: the one of next section 
which, as noticed, we have dubbed Unconventional Statistical Mechanics; we do 
also have the so-called Superstatistics proposed by C. Beck and E. G. D. Cohen 
for driven nonequilibrium systems with a stationary state and intensive parameter 
fluctuations [4,5]; what can be called Deformational Statistics [11,56], and other 
approaches could be possible. 

We present here a derivation of USM in terms of unconventional informational- 
entropies. They are related to a family of so-called statistical measures in a metric 
space of statistical distributions, when it is provided a distance of the sought-after 
statistical distribution with a reference distribution: a principle of minimization 
of this distance [MinxEnt for short) is equivalent to the maximization of the asso- 
ciated infoentropy (MaxEnt) [54]. This is discussed in the next section, whereas 
in Section 3 we consider the formulation of a nonequilibrium-statistical ensem- 
ble formalism for far-from-equilibrium systems based on the use of one particular 



unconventional infoentropy, namely the one due to Renyi [57]. In Section 4 we 
derive generalized distribution functions for fermions and bosons, which in Renyi 
statistics enter in place of the standard Fermi-Dirac and Bose-Einstein distribu- 
tions. They are used in the follow up article to analyze experiments in condensed 
matter physics. Finally, Section 5 is devoted to the presentation of some addi- 
tional general remarks and a summary of the results together with some further 
considerations. 

2. INFORMATIONAL ENTROPY OPTIMIZATION PRIN- 
CIPLE 

Use of the variational MaxEnt for building NESEF provides a powerful, practical, 
and soundly-based procedure of a quite broad scope, which is encompassed in what 
is sometimes referred-to as Informational-Entropy Optimization Principles (see for 
example Ref. [54]). To be more precise we should say constrained optimization, 
that is, restricted by the constraints consisting in the available information. Such 
optimization is performed through calculus of variation with Lagrange's method 
for finding the constrained extremum being the preferred one. 

Jaynes' variational method of maximization of the informational-statistical 
entropy is connected - via information theory in Shannon-Brillouin style - to a 
principle of maximization of uncertainty of information. This is the consequence 
of resorting to a principle of scientific objectivity [24, 25], which can be stated as: 
Out of all probability distributions consistent with a given set of constraints, we 
must take the one that has maximum uncertainty. 

As noticed in the Introduction, its use leads to a construction wholly equivalent 
to the one in Gibbs' ensemble formalism, recovering the traditional results in equi- 
librium [16, 17, 22], and allowing for the extension to systems far from equilibrium 
[23,28-30]. 

Jaynes' MaxEnt is a major informational-entropy optimization principle re- 
quiring, as noticed, that we should use only the information which is accessible 
but scrupulously avoiding to use information not proven to be available. This 
is achieved by maximizing the uncertainty that remains after all the given infor- 
mation has been taken care of. However, this maximization of uncertainty can 
be looked at from a different approach. This is the MinxEnt principle, consist- 
ing into, first, to introduce a space of probability distributions and an associated 
metric defining a distance between two probability distributions and, second, a 
referential a priori distribution. According to the principle: Out of all proba- 



bility distributions satisfying the given constraints, choose the one that is closest 
(minimum distance) to the given referential distribution. 
Consequently, to carry this programme we must: 

(1) Introduce a metric considered to be appropriate for the problem in hands; 

(2) To have two types of information, namely, 

(i) information consisting into giving the referential probability distribution, 
what would be based on intuition or experience related to the given problem; 

(ii) information consisting of the constraints, through accessible observation and 
theoretical knowledge. 

The MinxEnt principle can be considered to be based on common sense, as 
it is MaxEnt. In it the distribution that is derived is consistent with the given 
information, but among all that satisfy the given constraints we choose the one 
that is nearest to our intuition and experience. However, if we do not have a priori 
experience or an intuition to guide us, we must choose the uniform distribution 
as the referential one. This is so because we would be satisfying the principle 
of indifference in Logic, adjudicating to each event the same probability because 
doing otherwise we would be introducing information we do not have (we would 
be "playing with a loaded dice"). Introducing as the referential probability the 
uniform one, the probability distribution derived from MinxEnt, i.e., once it is 
defined a proper distance to the one that is minimized subjected to a set of given 
constraints, coincides the probability distribution which is obtained in MaxEnt, 
as shown below. 

The distance d (g \ g r ) between distribution g and the reference distribution 
g r takes the usual definition of being a single-valued, nonnegative, real quantity 
satisfying the properties of invariance by inversion, the triangular inequality, and 
being a convex function of g. 

Let us consider the case when the uniform distribution is taken as the reference 
one, which we call U, and then MinxEnt in terms of U is restated as: Out of all 
probability distributions satisfying given constraints, it is to be taken the one that 
is closest (i.e. at the minimum distance) to the uniform distribution, i.e. d(g\V) 
is minimum under the given constraints, for, of course, a given metric (a given 
d). In other words, for given constraints (information) the optimized - in the 
sense already discussed - distribution is the "nearest" to the uniform distribution 
corresponding to "maximal ignorance": thus the uncertainty is maximized as it 
is also required by MaxEnt. 



But now arises the question of which should be such distance. We begin to 
discuss the one which leads to recover Boltzmann-Gibbs formalism in Shannon- 
Jaynes approach, consisting in the so-called Kullback-Leibler metric, namely [58] 

d KL (g \ U)=Tr{ Q (In g-lnW- 1 )} , (1) 

where we have called W~ x the uniform probabilities corresponding to the physical 
states accessible to the system in the problem under consideration. Hence, 

d KL (Q | U) = In W + Tr{g In g} = In W - S BG , (2) 

with 

S B g = ~Tr {g In g} (3) 

being Boltzmann-Gibbs-Shannon-Jaynes infoentropy for distribution g. 

Evidently, to minimize cIkl(q | U) under given constraints is equivalent to 
maximize Sbg under such constraints, once In W is a constant. Consequently 
Shannon- Jaynes MaxEnt is equivalent to use MinxEnt in terms of Kullback-Leibler 
metric. Moreover, we call the attention to the fact that the set of constraints may 
contain quantities (basic variables) related to correlations (i.e. second order, third 
order, etc. variances) besides additive quantities. 

Jaynes' MaxEnt aims at maximizing uncertainty when subjected to a set of 
constraints which depend on each particular situation (given values of observables 
and theoretical knowledge and some reliable guessing). But uncertainty can be 
a too deep and complex concept for admitting a unique measure under all con- 
ditions. We may face situations where uncertainty can be associated to different 
degrees of fuzziness in data and information. As already noticed, this is a conse- 
quence, in Statistical Mechanics, of a lack of a proper description of the physical 
situation. This corresponds to being violated the Criterion of Sufficiency in the 
characterization of the system ( "the statistics chosen should summarize the whole 
of the relevant information supplied by the sample" ) [2] . This could occur at the 
level of the microscopic dynamics (e.g. lack of knowledge of the proper eigenstates, 
all important in the calculations), or at the level of macroscopic dynamics (e.g. 
when we are forced, because of deficiency of knowledge, to introduce a low-order 
truncation in the higher-order hydrodynamics that the situation may require): 
both situations are illustrated in the follow up article. Hence, in these circum- 
stances it may arise the necessity of introducing alternative kind of measures, 
with the accompanying indexed (or structural) informational-entropies, [infoen- 
tropies for short ) to build statistical descriptions other than the conventional, well 
established and logically sound of Boltzmann-Gibbs. 



Let us consider some cases of particular measures: A large family of measures 
(distances) is the one provided by I. Csiszer [59], namely 

d c (g\g r )=Tr{g$(R)} , (4) 

where R = QQ' 1 , with $ (z) being a twice differentiate convex function of z and 
$(1) =0 (i.e. for g = g r ). Let us specify it for g r = U; then Kullback-Leibler 
measure follows for $ (R) = In R. In Table I we present a few examples of the 
infinitely-many measures that are possible, all for g r = U, as defined by several 
authors, where W' 1 , we recall, is the value of the uniform probability for each 
state, and a, (3 are numerical indexes (called infoentropic indexes). 

Applying MinxEnt to any of these distances we would get the probability 
distribution deemed appropriate for the given problem in hands, namely, the 
conventional one in Kullback-Leibler metric, and others, so-called in Pearsons' 
nomenclature, heterotypical probability distributions. But, as shown in the case 
of the Kullback-Leibler metric such minimizing principle is equivalent to Jaynes 
MaxEnt [cf. Eq. (2)], and similarly it follows that all the cases considered have an 
associated informational statistical entropy (ISE), whose maximization provides 
the corresponding optimal probability distributions. The structural-informational 
entropies corresponding to the measures of Table I, except for multiplicative and 
additive constants, are given in Table II: we recall that they are a quite few among 
the enormous number of possibilities, and which are cross-entropies for which the 
uniform probability distribution has been chosen as the reference one. 

Renyi approach appears to be a particularly convenient one to deal with fractal 
systems as discussed in Ref . [8] , where it is pointed out that predictions obtained 
resorting to the approach of maximization in Shannon- Jaynes approach including 
fractality can be equivalently obtained using Renyi approach ignoring fractality 
(see also follow up article). Renyi ISE has been studied by Takens and Verbitski 
[63], and a variation of it is Hentschel-Procaccia infoentropy [7] (see also the 
contributions of Refs. [64,65]. For the Havrda-Charvat structural a-entropy, one 
akin to the case a = 2 has been considered by I. Prigogine in connection with 
practical and theoretical difficulties with Boltzmann ideas when extending them 
from the dilute gas to dense gases and liquids [66]. Prigogine argues that to 
cope with such situations one would need a statistical expression of entropy that 
depends explicitly on correlations, as is the case of the Havrda-Charvat structural 
a-entropy for a = 2 (also in the case of Renyi infoentropy). 



TABLE I: Special cases of Csiszer's Measure 

Kullback-Leibler [58] { In W + Tr {g In g} 

Havrda-Charvat [60] I a ~ l \ , y , 

! a > (J and 1 

Shanna-Mittal [61] ( ^ Tr " [ T«£ ' ] 1 

LJ [ a > 1, (3 <1 or a < 1, (3>l 

„ . rrfTl f In + In Tr{a a } 

Renyi 57 <^ , /f J 

L 1 I a > and a^l 

Kaour [621 I \nW + ^ [\nTr {g<*} -InTr {q?}] 

KapUI [bA 1 a > 0, /3 > and a ^ (3 



TABLE II: Informational-Statistical Entropies 

Conventional (Universal) ISE 

Boltzmann-Gibbs-Shannon-Jaynes ISE { — Tr {g In g} 
(from Kiilback-Leibler measure) 

Unconventional (entropic-index-dependent) ISEs 

-^Tr^-g} 



From Havrda-Charvat measure 



From Sharma-Mittal measure < a ~P 

a > 1, (3 < 1 or a < 1, /3 > 1 



a > and a^l 

■^Tr { [W a -Pg a ^ +1 - g] g 3 ' 1 } 



From Renyi measure 
From Kapur measure 



__l_l nTr{ ^ } 
a > and a ^ 1 

[In TV {^}- In Tr{^}] 

a > 0, (3 > and a ^ (3 



It can be noticed that taking /3 — 1 reduces Kapur ISE to the one of Renyi, 
and Sharma-Mittal ISE to the one of Havrda-Charvat. Moreover, taking also 
a — 1, is obtained an ISE which is of the form of Boltzmann-Gibbs-Shannon- 
Jaynes ISE. What we do have in these ISE's, or in any other one of the infinitely- 
many which are possible, is that when the adjustment of the parameters (the 
infoentropic indexes) on which they depend - let it be in a calculation or as 
a result of the comparison with the experimental data (see follow-up article) - 
produces Boltzmann-Gibbs result, this gives an indication that the principle of 
sufficiency is being satisfied, i.e., for such particular situation the description of 
the system we are doing includes all the relevant characterization that properly 
determines the physical property that is measured in the given experiment being 
analyzed. The point has also recently been discussed by Nauenberg [51], and it 
is illustrated in the follow-up article: In the insufficient descriptions - as there 
described - the parameter a (as noticed called infoentropic index) is different 
from 1 and depends on each case on the system geometry, boundary conditions, 
mainly its thermodynamic state (in equilibrium or out of it in steady states or 
time-evolving conditions), the experimental protocol, and so on. 

Moreover, we again stress the fundamental fact that the structural informational- 
entropies (quantity of uncertainty of information) are not to be confused with the 
Clausius-Boltzmann physical entropy: There is one and only one case when there 
is an equivalence, consisting of Shannon infoentropy when the system is strictly in 
equilibrium [8,46-48]. Boltzmann-Gibbs-Shannon-Jaynes informational entropy 
and its role in NESEF is extensively discussed in Refs. [29,34,67]. 

It is quite relevant to notice that for each kind of statistical entropy it is 
necessary in an ad hoc manner, to introduce definitions of average values of ob- 
servables with particular forms, what is required to obtain a posteriori consistent 
results. For the case of Kullback-Leibler measure, or Shannon- Jaynes statisti- 
cal informational-entropy, we must use the usual expression, i.e. the average of 
quantity A is given by 



while for the case of Renyi ISE, needs be introduced an average of the form 




(5) 




(6) 



that is, in terms of the so-called escort probability [68,69] 



Va{Q} = Q a /Tr{(f} 



(7) 



which is also the one to be used in the case of Havrda-Charvat statistics. Appar- 
ently, the use of the altered distribution of Eq. (7) - later called escort probability 
- was originally proposed by Renyi: It appears that the motivation behind is that 
the quantity of information using the insufficient description in the unconventional 
approach (incomplete probabilities in Renyi's nomenclature) equals the quantity 
of information using the conventional Shannon expression but in terms of the 
escort probability of Eq. (7) plus the gain in information when one introduces 
T> a in place of the incomplete g [see Chapter IX, p. 569 et seq., in Ref. [53]). 
Generalization of the concept of escort distributions is given by Beck and Schlogl 
(see Chapter 9 in [68]), who have also shown that for the particular case of the 
Renyi measure of order a (see Tables I and II) it follows that 

dT 

(1 - «) 2 = Tr {V a {gj (\nV a {gj - In g a )} , (8) 

where I a is Renyi information function (the negative of Renyi a-dependent entropy 
of Table II), and the right-hand side can be interpreted as the information gain 
when using the escort probability V a built in terms of the original one g a (see 
Chapter 5 in [68]). 

We also call the attention to the fact that the introduction of the escort prob- 
ability of a given distribution g, said incomplete in Renyi's sense (Chapter IX 
pp. 569 et seq. in Ref. [53]), adds to the normal definition of average value 
the presence of second and higher-order variances. In fact, and this is detailed 
in Appendix A, for the average value of an observable A in terms of the escort 
probability of order 7, if we write S — — In g and 7 = 1 + e, it follows that (see 
Appendix A) 

(A) = Tr {Av a [e }} = (A) o + e {(As) o - (A) o (S) J + 
4 {M)„ ~ M Mo + 2 ( A ). " 2 Mo M) + 

+0 (t 3 ) , (9) 

where 

(...)„ = Tr{...Q} , (10) 
that is, the normal average value. 



For illustration let us take for A the Hamiltonian H and a canonical distribu- 
tion q = Z _1 exp j— j, and then up to second order in e Eq. (9) becomes 

2 

£ = ^-H"^ = (-^) o + e P + e —f? A 3 E , (11) 

where 




are the second and third order variances of the energy. 

For specific illustrations see in the follow up paper the case of the ideal gas 
in a finite box, and in next Section the case of ideal quantum gases. Hence, 
complementing what was said previously, the use of the escort probability adds 
"information" through the inclusion of second and higher order particular vari- 
ances. 

We call the attention to the fact that USM is to be based on the use of both defi- 
nitions, namely, the heterotypical probability distribution and the escort probability 
(notice that for probability distributions other than Renyi and Havrda-Charvat 
other definitions of escort probabilities should be introduced). The role of the 
escort probability accompanying the heterotypical-probability distribution is that 
both complement each other in order to redefine, in the sense of weighting, the 
values of the probabilities associated to the physical states of the system; on the 
microscopic level and on the macroscopic level the question is illustrated in the 
follow-up article. 

Of course other possibilities are open, that is, other statistical entropies or sta- 
tistical measures. One attempt is due to W. Ebeling [70, 71] who has addressed 
the question of the statistical treatment of a class of systems that are in some sense 
"anomalous" . They contain those in nature and society which are determined by 
its total history. Usually the given examples are the evolution of the Universe and 
of our planet, phenomena at the biological, ecological, climatic, social levels, etc. 
The approach consists into introducing conditional probabilities in the context of 
Boltzmann-Gibbs formalism in Shannon- Jaynes approach, leading to a general- 
ized statistical entropy appropriate for describing the thermodynamics of complex 
processes with long-ranging memory and including correlations [70-72]; it can be 
referred-to as Ebeling statistics. 



We consider next the formulation of a nonequilibrium ensemble based on the 
particular case of Renyi informational-entropy. 

3. NONEQUILIBRIUM a-DEPENDENT RENYI ENSEM- 
BLE 

For systems away from equilibrium several important points need be carefully 
taken into account in each case under consideration [27,29]: 

(1) The choice of the basic variables (a wholly different choice than in equilib- 
rium when suffices to take a set of those which are constants of motion), which 
is to be based on an analysis of what sort of macroscopic measurements and pro- 
cesses are actually possible, and, moreover, one is to focus attention not only on 
what can be observed but also on the character and expectatives concerning the 
equations of evolution for these variables (e.g. Refs. [73, 74]). We also notice that 
eventhough at the very initial stages we would need to introduce all the observ- 
ables of the system, as time elapses more and more contracted descriptions can 
be used as enters into play Bogoliubov's principle of correlation weakening and 
the accompanying hierarchy of relaxation times [75]. 

(2) It needs be introduced historicity, that is, the idea that it must be incor- 
porated all the past dynamics of the system (or historicity effects), all along the 
time interval going from a starting description of the macrostate of the sample 
in the given experiment, say at t , up to the time t when the measurement is 
performed. This is a quite important point in the case of dissipative systems as 
emphasized among others by John Kirkwood and Hazime Mori: It implies in that 
the history of the system is not merely the series of events in which the system has 
been involved, but it is the series of transformations along time by which the sys- 
tem progressively comes into being at time t (when a measurement is performed), 
through the evolution governed by the laws of mechanics [76, 77]. 

(3) The question of irreversibility (or Eddington's arrow of time) on what 
Rudolf Peierles stated that: "In any theoretical treatment of transport problems, 
it is important to realize at what point the irreversibility has been incorporated. 
If it has not been incorporated, the treatment is wrong. A description of the 
situation which preserves the reversibility in time is bound to give the answer zero 
or infinity for any conductivity. If we do not see clearly where the irreversibility 
is introduced, we do not clearly understand what we are doing" [78]. 

Points (1) to (3) above are discussed in Ref. [29], where it is presented a 
complete description of the construction of ensembles for nonequilibrium systems, 



within the general theory provided by the use of Boltzmann-Gibbs formalism in 
Shannon- Jaynes approach. 

We present next the construction of an unconventional nonequilibrium sta- 
tistical ensemble formalism. First we call the attention to the situation where 
it is applied, namely, the experiment in condensed matter. Consider the most 
general experiment one can think of, namely a sample (the open system of inter- 
est composed of very-many degrees of freedom) subjected to given experimental 
conditions, as it is diagrammatically described in Fig. 1. 

In Fig. 1, the sample is composed of a number of subsystems, <Tj, (or better 
to say subdegrees of freedom, for example, in solid state matter those associated 
to electrons, lattice vibrations, excitons, impurity states, collective excitations as 
plasmons, magnons, etc., hybrid excitations as polarons, polaritons, plasmaritons 
and so on). They interact among themselves via interaction potentials producing 
exchange at certain rates, r^-, of energy and momentum. Pumping sources act 
on the different subsystems of the sample - via particular types of fields, electric, 
magnetic, electromagnetic, etc. - which should of course be very well characterized 
on setting up the experiment, and there follows relaxation of the energy in excess 
of equilibrium to the external reservoirs, Tjr. Finally, the experiment is performed 
coupling an external probing source, characterized in the figure by P (t), with one 
or more subsystems of the sample, and some kind of response, say R (t), is detected 
by a measuring apparatus (e.g. ammeter, spectrometer, etc.) Here the pumping 
sources exert their influence on the given open system through the fields they 
generate, say, magnetic, electric, electromagnetic as produced for example from a 
laser machine, and so on, eventually, in scattering experiments is the interaction 
potential with the particles of an incoming beam. 

Furthermore, for simplicity, in order to avoid a cumbersome description which 
would obscure the presentation of the matter, we restrict the situation to the 
case when it is assumed that the probed subsystem o\ is driven out of equilib- 
rium, while remaining in contact (interaction) with the other subsystems which 
are taken as an ideal thermal bath (their macroscopic states remaining constantly 
in equilibrium with the external reservoirs). According to theory the nonequi- 
librium statistical operator is a superoperator of an auxiliary one dubbed "quasi- 

equilibrium instantaneous frozen" statistical operator, say, 7Z (t, 0) [29, 30]. In the 
conditions stated above it is composed of the product of the one of the subsystem 
under consideration, q (t, 0), times the constant one of the thermal bath and reser- 
voirs (the coupling between the subsystems and with the reservoirs is introduced 
in the construction of the nonequilibrium statistical operator shown below). 



We concentrate the attention on the statistical operator of the subsystem of 
interest - from now on simply called the system -, and then once the auxiliary 
operator g (t, 0) is given, we can built the nonequilibrium statistical operator, say 
g e (t), which can be given in either of two equivalent forms, one being ( [77, 79]) 

,„,./«,.-,„,-„ . 

— oo 

where 

g(t',t'-t) = exp^~{t , -t)H^g(lf,0)exp^(t'-t)H^ , (15) 

g(t,0) = exp{-S(t,0)} (16) 

and 

n „ 

S (t, 0) = (t) + / d'r F 3 (r, t) P 3 (r) (17) 

is the so-called informational-statistical-entropy operator which is extensively dis- 
cussed in Ref. [80]. In these expressions, H is the system Hamiltonian and 

|Pj (r) j , j = 1,2, constitutes the set of basic dynamical variables describing 

the nonequilibrium macroscopic state of the system, with the average values of 
them - in terms of the distribution of Eq. (14) - constituting the set {Qj (r, t)} 
of basic macrovariables in the nonequilibrium thermodynamic state of the sys- 
tem [34]. In Eq. (17), {Fj (r,t)} , j = 1,2, is the set of Lagrange multipliers 
(intensive nonequilibrium thermodynamic variables [34,81] that the variational 
procedure introduces), and <j) (t) ensures the normalization of the distribution and 
can be considered as being the logarithm of a nonequilibrium partition function, 
i.e. (t) =lnZ (t). Finally, e exp {e (t' — t)} is Abel's kernel (in the theory of con- 
vergence of integral transforms), with e being a positive infinitesimal which goes 
to zero after the calculation of averages have been performed. This introduces 
the concept of Bogoliubov's quasiaverages [82], and leads to irreversible evolution 
from an initial condition, what it does by selecting the retarded solutions of the 
Liouville equation that g satisfies, i.e. the advanced solutions are discarded in a 
quite similar way as done by Gell-Mann and Goldberger in the case of Schrodinger 
equation in scattering theory [83]. 



Equation (14) can be rewritten, after integration by parts in time, as 

Q e (t) = Q(t,0) + g' e (t) , (18) 

where g (t, 0) is given in Eq. (16) and 

t 

m = - J dt'e^'-^H^t'-t) . (19) 

— oo 

According to Eq. (18), the proper statistical operator g e is composed of two con- 
tributions, namely g which is the so-called "instantaneously frozen" contribution 
of Eq. (16) and g' e which is responsible for the description of the irreversible 
evolution of the system, and it is the contribution that introduces historicity in 
the theory. Some confusion sometimes occurs when some authors use g as the 
proper statistical operator: This auxiliary distribution, (i) does not satisfy Liou- 
ville equation, (ii) does not describe the dissipative processes that develop in the 
system, (Hi) does not provide the correct kinetic theory for the description of the 
dissipative processes that are unfolding in the medium, (iv) does not give the cor- 
rect values of observables, other than those corresponding to the basic variables; 
this also applies to the case of steady states. We also call the attention to the fact 
that care must be exercised on the question of separating the state of the system 
from the one of the reservoirs [29]. Finally, we recall the important result that for 
the basic variables, and only for the basic variables, there follows that [28-30] 

Q j (r, t) = Tr [P j (r) g e (t)} = Tr [P j (r) g (t, 0)} . (20) 

Let us now consider the case of Renyi informational entropy, i.e. 

S tt (t) = --±-\nTr{[Q a (t,0)] a } ! (21) 
a — 1 

we notice that a recent application of Renyi's statistics for dealing with (multi) fractal 
systems is presented by Jizba and Arimitzu [8] : There it is addressed the question 
on how Renyi's approach appears as a quite convenient one in such cases. Further 
considerations on Renyi's approach can be consulted in the articles by Hentschel 
and Procaccia [7] and Takens and Verbitski [63]. We first proceed to find the 
"instantaneously frozen" auxiliary distribution, by maximizing S a subjected to 
the conditions of normalization 



Tr{g a (t,0)} = l 



(22) 



and the constraints consisting of the average values, as defined by Eq. (6), of the 
basic dynamical variables, namely 



Q j (T,t)=Tr\P j (r) V a {g(t,0)} 



(23) 



where 



T>aW,0)} = [Q a (t,0)] a /Tr{[Q a (t,0)] a } (24) 
is the corresponding escort probability [68, 69] (cf. discussion after Eq. (7) above). 
It follows that (see Appendix B) 



Qa M 



Va (t) 



1 + (a - 1) ^ / d 3 r F ja (r, t) APj (r, f) 



where 

APj (r, t) = Pj (r) - Q j (r, t) 
with Qj (r, t) given in Eq. (23), 



Va (t) = Tr 



1 + (a - 1) / d " r Fja (r, t) APj (r, t) 



(25) 



(26) 



(27) 



ensures the normalization condition, and Fj a are the Lagrange multipliers that the 
variational method introduces, which are related to the basic variables through 
Eq. (23). 

In terms of the auxiliary g a , the statistical distribution is given by [cf. Eqs. 
(14) and (16)] 

t 

Qae(t)=e f dt'e^g^t'J-t) , (28) 

— oo 

where, we recall, 

g a (t', t'-t) = exp j-1 (t' -t)Hy ea (?, 0) exp jl (*'-*) if j , (29) 
The statistical distribution of Eq. (28) satisfies the Liouville equation 

-e[Qae(t)-Qa(tM , (30) 



with the presence of the infinitesimal source introducing Bogoliubov's symmetry 
breaking procedure (quasiaverages), in the present case the one of time reversal 
(as already noticed in that way are discarded the advanced solutions of the full 
Liouville equation). Thus, the retarded solutions have been selected, and, a pos- 
teriori, this is transmitted to the kinetic equations producing a fading memory 
and irreversible behavior (cf. Refs. [29,35]). 

We also call the attention to the fact that for average values, as given by Eq. 
(6), we then have 

(A)=Tr{AD ae {Q ae (t)}} , (31) 

where 

Vc*{Qa e (t)} = fi e (t)/Tr{fi e (t)} , (32) 

and it is implicit the limit e — > after the calculation of traces has been performed. 

Because of the boundary condition g^ e (t Q ) = [g a (t , 0)] a (t — > — oo), we have 
that 

V ae {g ae {t )} =V a {g a {t o ,0)}, where V a is given by Eq. (24). For e -> 0, 
g a£ satisfies a true Liouville equation [cf. Eq. (30)], and so does V ae , and we re- 
call that the infinitesimal source on the right-hand side of Eq. (30) is selecting the 
retarded solutions of the true Liouville equation (via, then, Bogoliubov's method 
of quasiaverages, as previously noticed). Hence, for the given initial condition and 
the imposition of discarding the advanced solutions, V ae {g ae (t)} also satisfies a 
modified Liouville equation, and we can write 

V ae {g ae (f)} =V a {Q a (t, 0)} + V' ae (t) , (33) 

where V a {g a (t, 0)} is given by Eq. (24), and 

t 

Ke (t) = ~ J dt'e^'-^ V a {g a (f, t'-t)} . (34) 

— oo 

Introducing Eq. (33) into Eq. (31), we can see that the averages are com- 
posed of an "instantaneously frozen" (at time t) contribution, plus a contribution 
associated to the irreversible processes and including historicity. For the basic 
dynamical quantities, and only for them [cf. Eq. (20)], it follows that 

Qi (r, t) = Tr [P V ae {g ae (*)}} = Tr {/>■ V a {g a (t, 0)}| . (35) 



with, as already noticed, being implicit the limit of e going to +0 to be taken after 
the calculation of the trace operation has been performed. 

After the nonequilibrium distribution using an heterotypical index-dependent 
informational-entropy has been derived, next step - like done in the conven- 
tional case [29, 34-36, 67, 81] - should consists in deriving for arbitrarily far-from- 
equilibrium systems, a nonlinear quantum kinetic theory, a response function the- 
ory, and, of course, a systematic study of experimental results, that is, a full collec- 
tion of measurements of diverse properties of the system, amenable to be studied in 
terms of structural (infoentropic-index dependent) informational-entropies, what 
is fundamental for the validation of the theory (some examples are presented in 
the follow-up article). 

In the next section we derive the corresponding unconventional distributions 
for free fermions and bosons in far-from-equilibrium conditions, which are always 
present in the calculations of physical properties and response functions (see follow 
up article). 

Closing this Section we recall that the previous analysis was done on the basis 
of considering a subsystem of the sample as out of equilibrium, but keeping the 
rest (so-called thermal bath) in constant equilibrium (or near equilibrium) with 
the reservoirs. For an unconventional nonequilibrium statistical mechanics, say 
in Renyi's approach, without the restriction we would need to write the auxiliary 
"quasi-equilibrium statistical frozen" operator as a product involving those of 

each and all the n subsystems, namely K (t) = Q ai (t) ® ••• ® &»„(*) ® QR eS ervoirs, 
adjudicating an infoentropic index aj, j = 1,2, ...,n to each subsystem. 



4. UNCONVENTIONAL DISTRIBUTIONS OF INDIVID- 
UAL 

FERMIONS AND BOSONS 



Let us consider the auxiliary "instantaneously frozen" nonequilibrium statistical 
operator of Eq. (25). After some straightforward mathematical manipulations it 
follows that it can be rewritten in a more convenient form for performing calcula- 
tions, namely, for the homogeneous case (i.e. neglecting dependence on the space 
variables) 




(36) 



where 



V a (*) = Tr 



Fj a (t) — Fj a (t) 



l-(a-l)J2 F ma (t) Qm (t) 



(37) 



(38) 



Equation (37) stands for a modified form of the quantity that ensures the normal- 
ization condition, and Eq. (38) for redefined Lagrange multipliers. 

We proceed next to derive the distribution functions for fermions and for 
bosons using USM in terms of Renyi structural statistical approach. We choose 
as basic dynamical variables, i.e. the Pj, the set of occupation number operators 



(39) 



where c (c^) are the usual annihilation (creation) operators in states |k), satisfy- 
ing the corresponding commutation and anticommutation rules of, respectively, 
bosons and fermions (the spin index is ignored). Their average values are the 
infoentropic-index a-dependent distribution functions 



/k (t) = Tr {c[c k V ae {g ae (*)}} = Tr jc^ V a {g a (t, 



0)} 



(40) 



where we have used Eq. (35) valid for the basic variables. The auxiliary statistical 
operator is then [cf. Eq. (36)] 



Qa (*,0) 



l + (a-l)£ Ka (t) c{c k 



(41) 



with [cf. Eq. (38)] 

Fka (t) = Fk a (t) 



(42) 



The populations of Eq. (40), according to the calculation described in Ap- 
pendix C, take the form 



/k(t) = /k(*)+C k (*) 



(43) 



where 

k (t) = -n , (44) 

1 + (a - 1) F ka (t) 



a-l ±1 



where upper plus sign stands for fermions, and the lower minus sign for bosons, 
and 

C k (t) = a (1 - a) (1 - f k (t)) F *<* (*) F k , a (t) Tr ^c{c k c{ lCk , V a {g (t, 0)}|+..., 

(45) 

involving two, three, etc. particle correlations, which in general are minor correc- 
tions to the first, and main, contribution, the one given by Eq. (44). 

In the limit of a going to 1, which applies when the criteria of efficiency and/or 
sufficiency is satisfied, Renyi statistical entropy acquires the form of Boltzmann- 
Gibbs-Shannon-Jaynes one, C becomes null, F ka (t) becomes F k (t), and then 

AM = PE5F±T ' (46) 

(In equilibrium F k (t) — > (e k — fj) /k B T and there follows the traditional Fermi- 
Dirac and Bose-Einstein distributions). 

We can see that the distribution of Eq. (43) is composed of a term / corre- 
sponding to the individual particle in state |k), plus the contribution C containing 
correlations (of order two, three, etc.) among the individual particles. This type 
of calculation but for systems in equilibrium, and not using the average value 
defined in Eq. (6), in terms of the escort probability, was reported in Ref. [84]. 

Let us now give some attention to the Lagrange multipliers F ka (t). The most 
general statistical operator for nonequilibrium systems can be expressed in the 
form of a generalized nonequilibrium grand-canonical statistical operator for a 
system of individual quasiparticles, where the basic variables are independent 
linear combinations of the single-quasiparticle occupation number operators [cf. 
Eq. (39)], consisting of the energy and particle densities and their fluxes of all 
order [29,42,85,86]. In this description we have that (see also Section 4 and 
Appendix B in the follow up article) 

F ka (t) = P a (t) [e k - fi a (*)] - v ha (t) ■ e k u (k) - v noi (t) • u (k) - 

" E [^S (*) ® £k« [r] (k) + FM (t) ® «M (k)] , (47) 



r>2 



where has been introduced the quantities (3 (t) = l/k B T* (£), playing the role of 
a reciprocal of a quasitemperature [87,88], fi a (t) is a quasi-chemical potential, 
u ha (t)and i> na (t) are vectors, and F^ and F„a r-th rank tensors. Moreover, 

u [r] (k) = [u (k) ... (r - times) ...u (k)] , (48) 

is the tensorial product of r-times the characteristic velocity u (k) = ^.^Vkek) 
where ey_ is the energy dispersion relation of the single-particle, and then u (k) is 
the group velocity in state |k). Dot stands as usual for scalar product of vectors, 
and <S> for fully contracted product of tensors. 

To better illustrate the matter, we introduce a simplified description, or better 
to say a quite truncated description, proceeding to neglect in Eq. (47) all the 
contributions arising out of the fluxes, i.e. we put v = and F^ = 0, retaining 
only the first term on the right-hand side. Therefore, we do have that 

7k (*) = , (49) 



a-l 



± 1 



l + (a-l)P a (t) [e k -// Q (*)] 
where 

K (*) =0 a (t)/{l-( a -l) (3 a (t) [E [t) - p a (t) N (t)]} • (50) 
In this Eq. (50) E (t) is the energy 

£(0-J>k/k(*) , (51) 

k 

and N the number of particles 

N(t)^ /k (t) , (52) 

k 

where the correlations C in Eq. (43) have been neglected. Moreover, in many 
cases we can use an approximate expression for the populations, that is, in the 
one of Eq. (44) we admit that ±1 can be neglected in comparison with the other 
term. This is considered as taking a statistical nondegenerate limit, once, if we 
put a going to 1 (what, we again stress, strictly corresponds to the situation 
when the principle of sufficiency is satisfied), the population takes the form of a 



Maxwell-Boltzmann distribution with quasitemperature T* (t) at time t. In this 
condition the expression for the population can be written as 



where 



and 



h{t) = A a (t)[l + (a-l)B a (t) e k ] 



A a (t)= I -{a- 1) P a (t) n a (t) 



B a (t)=P a (t)/ l-(a-l)P a (t)ii a (t) 



(53) 

(54) 
(55) 



Consider a parabolic dispersion relation, that is, ek = H 2 k 2 /2m*. Using Eq. 
(53) in Eqs. (51) and (52), we arrive at the result that 



n(t) 



e(t) 



N(t) 
V 

E(t) 
V 



A T (0 ^ J 1/2 (a) 



n(t) 



'3/2 



la) 



k B T a (t) 



(56) 



(57) 



h/2 (a) 

with the integrals I u (a) shown in Appendix D, and we have introduced the defi- 
nition 

B- 1 (t) = k B T a (t) , (58) 

where T plays the role of a pseudotemperature and where \ a in Eq. (56) is a 
characteristic length given by \ 2 a (t) = H 2 /m*k B T a (t) (that is, de Broglie wave 
length for a particle of mass m* and energy k B T a (£))• 

We can see that the above Eqs. (56) and (57) define the Lagrange multipliers, 
$ a (t) and \i a (£), present in A a (£), \ a (t), and B a (t), in terms of the basic vari- 
ables energy and number of particles. Moreover, using Eq. (56) we can obtain 
an expression for the quasi-chemical potential in terms of quasitemperature and 
density, namely 



1 - (« - 1) K (t) ii a (t) = [An 2 \l (t) /h /2 (a)] ^ [n (t)}^ 



(59) 



Also, it can be noticed that for a—1 (provided that the condition of sufficiency 
is satisfied) one recovers the equivalent of the results of conventional nonequilib- 
rium statistical mechanics [29,34], which are 



e(t) = -n (t) k B T* (t) 



(60) 



where we have introduced the so-called quasitemperature [29,34,87], defined by 
ksT* (t) = (t), this equation standing for a kind of equipartition of energy 
at time t, and 

H (t) = -fi a k B T* (t) In [T* (t) /9 tr (t)\ , (61) 

where 9 tr (t) = H 2 n 2 ^ 3 (t) /2m* is the characteristic temperature (here in nonequi- 
librium conditions and at time i) for translational motion. This suggests us to de- 
fine a so-called "kinetic temperature" 6^ (t) [89] by equating e (t) to (3/2) n (i) k B QK (t), 
given, after Eq. (57) is used, by 

Q K (t) = T a (t) I (5 - 3a) , (62) 

where we can see that a must be smaller than 5/3, as shown in the follow up 
article where connection of theory with experiment is presented, together with 
other illustrations and discussions. 

How does the a-dependent distribution of Eq. (49) compares with the usual 
Fermi-Dirac and Bose-Einstein distributions? For illustration we consider the 
nondegenerate limit of Eq. (53), common to both, where parameter B is related 
to the kinetic temperature 6^ by Eqs. (58) and (62). Taking for T a of Eq. (58) 
the unique value of 300.fr, we do find in Figures 2 and 3 a comparison of the 
population of Eq. (53) corresponding to several values of the infoentropic index 
a. It can be noticed the characteristic of a different weighting of the values of 
the standard distribution (a ~ 1), such that: (1) for a < 1 the population of 
the modes at low energies are increased at the expense of those of higher energies 
(e > 7 x lO^eV), while (2) for a > 1 we can see the opposite behavior. 

5. COMMENTS AND CONCLUDING REMARKS 

Summarizing, we first notice the relevant point that in the construction of a sta- 
tistical mechanics, the derivation of an appropriate (for the problem in hands) 
probability distribution - associated to a set of constraints imposed on the system 
- can be obtained in a compact and practical way by means of optimization- 
variational principles in a context related to information theory. These are meth- 
ods of maximization of the so-called informational-entropies (better called quan- 
tities of uncertainty of information) or minimization of distances in a space of 
probability distributions (MaxEnt and MinxEnt respectively). 

In the original formulation of Shannon and Jaynes use was made of Boltzmann- 
Gibbs statistical-entropy, which in MaxEnt provides the canonical-like (exponen- 
tial) distributions of classical, relativistic, and quantum statistical mechanics. In 



Ref. [29] it is described its use for the case of many-body systems arbitrarily far 
removed from equilibrium, and the discussion of the dissipative phenomena that 
unfold in such conditions (mainly ultrafast relaxation processes; see Ref. [37]). 
These statistical distributions also follow from MinxEnt once we use Kullback- 
Leibler measure with the uniform probability as the referential one in the defini- 
tion of the corresponding distance. 

This approach has been exceedingly successful in conditions of equilibrium, and 
is a very promising one for nonequilibrium conditions. To have a reliable statistical 
theory in these situations is highly desirable since in very many situations - as for 
example are the case of electronic and optoelectronic devices, chemical reactors, 
fluid motion, and so on - the system is working in far-from-equilibrium conditions. 

However the enormous success and large application of Shannon- Jaynes method 

to 

Laplace-Maxwell-Boltzmann-Gibbs statistical foundations of physics, as it has 
been noticed, some cases look as difficult to be properly handled within the 
Boltzmann-Gibbs formulation, as a result of existing some kind of fuzziness in 
data or information, that is, the presence of a condition of insufficiency in the 
characterization of the (microscopic and/either macroscopic or mesoscopic) state 
of the system. Such, say, difficulty with the proper characterization of the system 
in the problem in hands, (which is a practical one and, we stress, not intrinsic 
to the most general and complete Boltzmann-Gibbs formalism) can be, as shown, 
patched with the introduction of peculiar parameter-dependent alternative struc- 
tural informational-entropies (see Table II). 

Particularly, to deal with systems with some kind of fractal-like structure the 
use of Boltzmann-Gibbs-Shannon- Jaynes infoentropy would require to introduce 
as information the highly correlated conditions that are in that case present. Two 
examples in condensed matter physics (described in the follow up article) are 
"anomalous" diffusion [90] and "anomalous" optical spectroscopy [91], when frac- 
tality enters via the non-smooth topography of the boundary surfaces which have 
large influence on phenomena occurring in constrained geometries (nanometer 
scales in the active region of the sample). In the conventional and more general 
approach, the spatial correlations that the granular boundary conditions introduce 
need be given as information (to satisfy the criterion of sufficiency, since they are 
quite relevant for determining the behavior of the system in the nanometric scales 
involved), but to handle them is generally a nonfeasible task. For example, in the 
second case above mentioned one has no easy access to the determination of the 
detailed topography of the surfaces which limit the active region of the sample (the 



nanometric quantum wells in semiconductor heterostructures) , what can be done 
in the first case using atomic-force microscopy and the determination of the fractal 
dimension involved is possible. Hence the most general and complete Boltzmann- 
Gibbs formalism in Shannon- Jaynes approach becomes hampered out and is dif- 
ficult to handle, and then, as shown, use of other types of informational-entropies 
(better called generating functionals for deriving probability distributions) may 
help to circumvent such inconveniency by introducing alternative algorithms (de- 
pendent on the so-called informational-entropic indexes), that is, the derivation of 
heterotypical probability distributions on the basis of the constrained maximiza- 
tion of unconventional informational-statistical entropies (quantity of uncertainty 
of information), to be accompanied, as noticed in the main text, with the use of 
the so-called escort probabilities. 

Summarizing, Unconventional Statistical Mechanics consists of two steps: 1. 
The choice of a deemed appropriate structural informational- entropy for generat- 
ing the heterotypical statistical operator, and 2. The use of a escort probability in 
terms of the heterotypical distribution of item 1. 

As shown in the main text, and illustrated in the follow-up article, the escort 
probability introduces corrections to the insufficient description by including corre- 
lations and higher-order variances of the observables involved. On the other hand, 
the heterotypical distribution introduces corrections to the insufficient description 
(or incomplete probabilities in Renyi's nomenclature) by modifying the statistical 
weight of the dynamical states of the conventional approach involved in the situa- 
tion under consideration. Moreover, we have considered a particular case, namely 
the statistics as derived from the use of Renyi informational entropy (also used in 
the analysis of the experiments described in the follow-up article). We centered 
the attention on the derivation of an Unconventional Statistical Mechanics appro- 
priate for dealing with far-removed-from-equilibrium systems. Moreover, we have 
reported the calculation, in such conditions, of the distribution functions of sin- 
gle fermions and bosons, the counterpart in these unconventional statistics of the 
usual Fermi-Dirac and Bose-Einstein distributions, which are used and the results 
compared with experimental data in the follow-up article. These distributions are 
illustrated in Figs. 2 and 3. 

In conclusion, we may say that USM appears as a valuable approach, in which 
the introduction of informational-entropic-indexes-dependent informational-entropies 
leads to a particularly convenient and sophisticated tool for fitting theory to ex- 
perimental data for certain classes of physical systems, for which the criterion of 
sufficiency in its characterization cannot be properly satisfied. Among them we 



can pinpoint fractal-like structured nanometric-scale systems, which, otherwise, 
would be difficult to deal with within the framework of the conventional Statistical 
Mechanics. While in the latter case one would need to have a detailed description 
of the spatial characteristics of the structure of the system, the unconventional 
one needs to pay the price of having an open adjustable index to be fixed by 
best fitting with experimental results. It is relevant to notice the fact that the 
infoentropic index(es) is(are) dependent on the dynamics involved, the system's 
geometry and dimensions, boundary conditions, its macroscopic-thermodynamic 
state (in equilibrium, or out of it when becomes a function of time), and the 
experimental protocol. 

Finally, we call the attention to the fact that we have presented several alter- 
natives of cross-entropies (see Table II), for which, as stated in the main text, 
the uniform probability distribution is taken as the reference one, and such gen- 
erating functionals provide a corresponding family of heterotypical probability 
distributions. However, other choices of the reference probability can be made 
and then we have at our disposal very-many possibilities: It is tempting to look 
for the construction of a theory using for the probability of reference, instead of 
the uniform distribution, Shannon- Jaynes informational-entropy in its incomplete 
formalism, that is, when suffering from the deficiency that the researcher cannot 
satisfy Fisher's criteria of efficiency and/ or sufficiency. 
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Appendix A. The escort probability 



Let us consider the probability distribution g and construct the associated escort 
probability of order 7 

V 1 {q} = qVTt{q^ . (A.l) 

We write 7 = 1 + e and proceed with a series expansion of £> 7 around the value 
7 = 1, to obtain, on the one hand 

J2 



Q 1 = Q 



l + eS+-SS + ... 



and 



Tr {g 1 } = 1 + eTr j^j + ^Tr {gSs} 
where we have introduced the nomenclature 

S = -lng 



+ ... 



(A.2) 



(A.3) 



(A.4) 



Using these results, given any observable A its average value in terms of the 
escort probability is given by 



+ - 



A)=Tr[AV 1 {g}] = 
= Tr jipj + e [Tr ji^} - Tr ji^j Tr {§g} 
Tr {ASSg} - Tr Tr {sSg} + Tr {Ag 

-Tr^ASg^Tr^Sg} 



Tr 



Sg) 



(A.5) 



For illustration, let g be the auxiliary nonequilibrium statistical operator of 
Eq. (16), that is 



Q(t,0)=exp\-<l>(t)-Y, F j(t)Pj 

3=1 



(A.6) 



and the average of any of the basic observables, say P m [cf. Eq. (20)] in terms of 
the associated escort probability, given by 



Qrn (t) 



P,„ ! t) =Tr{P m g(t,0)} 



+ 



+e F o (*) [Tr [P m P 3 g (t, 0)} - Tr jp m g (t, 0)} Tr [Pj g (t, 0)}] = ... 

3=1 

(A.7) 

In terms of Renyi probability distribution, when the order of the escort prob- 
ability is to be chosen as equal to the infoentropic index, that is V a {g a (^0)} 
we do have the result of Eq. (A.7) but where g a (t, 0) enters as the probability 
g(t,0). 

Appendix B. Derivation in MaxEnt of Eq. (18) 

Given the constraints of Eqs. (22) and (23), with V a (t,0) defined in Eq. (24), 
and the statistical a-entropy of Eq. (21), according to Lagrange method we look 
for a maximum of the functional 

I (Q) = -— — -r ^ Tr {[g a (t, 0)f} + (t) Tr {g a (t, 0)} - 
a — 1 

dr 3 F ja (T,t)Tr{p j (r)V a (t,0)} , (B.l) 

where and Fj a are the corresponding Lagrange multipliers. The variational 
differential of I for a variation Sg a is given by 

H (g) a [gqft.O)] '" 1 , , (f) 

$Qa " a-lTr{[g a (t,0)] a } U 

~ E Tr{[g a \t,0)] a } I t] ^ ^ (t ' ' (R2) 



3 

where 



AP, (r, t) = P j (r) - Tr jp, (r) V a (t, 0) } 



= P j (r)-Q j (T,t) . (B.3) 
Making null Eq. (B.2) it follows that 



(a-l)<f>(t)Tr{[g a (t,0)] a }/a 



[g a (t, 0)] a_1 = ^ u ^ av ' /J s r , (B.4) 

1 + (a - 1) £ / dr3F ia (r, f ) APj (r, f) 



which can be written in the form 



5 « ((l0) = O) 



1 + 



(«-i)E / 



d 3 rF ja (r,t) APj(r,t) 



i 

" a-l 



(B.5) 



where 



f) a (f) = J dF 1 + (a - 1) J d3 r F i« (r, *) Ai>- (r, t) 



(B.6) 



ensures the normalization of g a and we have the expressions of Eqs. (25) and (27). 
We recall, and stress, that g a (t, 0) of Eq. (B.5) is an auxiliary operator, with the 
proper statistical operator resulting as a functional of this one once historicity is 
introduced, as indicated in Eq. (28). 

Appendix C. Calculation of Distribution Functions 

To proceed with the calculation of /k (t) of Eq. (40) we first write 

7v{4c k [g a ] a } = Tr {[QaT 4 [g a r a [QcT c k} =Tr|c k ([g a ] a c[ [gj'^ [g a ] 

(C.l) 

where g a is given by Eq. (28). We define 

A = (a - 1) J2 ^c k c k 



B = c{ 



and use that [92] 



where 



; v — a/ (1 — a) 
(l + i)" = l + J> ni > 



1 



TV. 



v(v-l) ... (u-n + 1) 



(C.2) 
(C.3) 
(C.4) 

(C.5) 



considering the eigenvalues of A as being smaller than 1 to ensure the convergence. 
Then, after some lengthy but straightforward calculations we find than 



[QoF 4 [eJ- a =(i + A) u B (i + a)~ u 



B + an 



A,B 



+ a 2v 



A. 



A,B 



+ ... + (a 2v -a 2 - v ) 



A,B 



A + . 



which, on account that, 



A, B 



= XB 



A, A,B 



\ 2 B 



, (C.6) 



(C.7) 



where A = — (1 — a) F^, can be rewritten as 



[qX 4 [g a ] a = 



1 + (a - 1) F k 



a-l | 



cl - iV k 



with 



iV k = a (a - 1) **** c[ 4 c k - + ... 



(C.8) 



(C.9) 



being a series composed of terms involving three, four, etc., single-particle creation 
annihilation operators,. Using Eq. (C.8) in Eq. (40) there follows Eq. (44), after 
taking into account that CkC^ = 1 =F c^Ck; (-) for fermions and (+) for bosons 
respectively. 

Appendix D. The Beta Functions of Eqs. (49) and (50) 

The functions of the parameter a of Eqs. (56) and (57) 

oo 

'.<«) = /**" H + <«-!)]* CD.1) 



are of the family of the so-called Beta functions, which are [92] 

oo 1 

B(x,y) = J dt {t ^ r+y = j dtt x - 1 {l-t)^ 1 = Y{x)Y{y)/Y{x + y) . 



(D.2) 

Using Eq. (D.2), after some handling, we find for I X j 2 (a) and I 3 / 2 (a) that for 
a > 1 

i r (3/2) r (sl. - 1) 

h/2 (a) = — K ' 2> , (D.3) 



[a 



- if 2 r fe) 



with the restriction 1 < a < 3, 

with the restriction 1 < a < 5/3. 

Using the property that T (z + \)—z V (z) it follows Eq. (57). 
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Figure 1: Diagramatic description of a typical pump-probe experiment in an 
open dissipative system. 
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Figure 2: The distribution of Eq. (53) for a kinetic temperature of 300K and 
values of Renyi's infoentropic-index a smaller than 1. 




CL 0.0-1 1 1 1 1 1 1 1 1 1 1 — 

0.000 0.002 0.004 0.006 0.008 0.010 



Energy e (eV) 



Figure 3: The distribution of Eq. (53) for a kinetic temperature of 300K and 
values of Renyi's infoentropic-index a larger than 1. 



